Picture for Lidong Bing

Lidong Bing

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Add code
Mar 30, 2026
Viaarxiv icon

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Add code
Mar 04, 2026
Viaarxiv icon

LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards

Add code
Mar 02, 2026
Viaarxiv icon

MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

Add code
Feb 26, 2026
Viaarxiv icon

Document Reconstruction Unlocks Scalable Long-Context RLVR

Add code
Feb 09, 2026
Viaarxiv icon

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

Add code
Feb 02, 2026
Viaarxiv icon

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Add code
Jan 14, 2026
Viaarxiv icon

EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning

Add code
Jan 05, 2026
Viaarxiv icon

On the Role of Discreteness in Diffusion LLMs

Add code
Dec 27, 2025
Viaarxiv icon